An Extended Home-ased Coherence Protocol for Causally Consistent Replicated Read-Write Objects
نویسندگان
چکیده
This paper considers the reliability of software Distributed Shared Memory systems where the unit of sharing is a persistent read-write object. We present an extended coherence protocol for causal consistency model, which integrates replication management with independent checkpointing. It uses a novel coordinated burst checkpoint operation in order to replicate consistent checkpoints of shared objects in local memory of distinct system nodes. No special reliable hardware devices are required. The protocol offers high availability of shared objects with limited overhead and ensures fast recovery in case of multiple node failures. In case of the network partitioning all the processes in a majority partition of the system can continuously access all the objects.
منابع مشابه
An Extended Coherence Protocol for Recoverable DSM Systems with Causal Consistency
This paper presents a coherence protocol for recoverable Distributed Shared Memory (DSM) systems with causally consistent read-write objects. It uses independent checkpointing tightly integrated with coherence operations. That integration results in high availability of shared objects and ensures fast restoration of the consistent state of DSM in spite of multiple node failures, introducing lit...
متن کاملGroup Protocol for Distributed Replicated Objects
In group protocols, larger computation and communication are consumed to causally order all messages transmitted in the network. Transactions in clients manipulate objects in servers by sending read and write requests to the servers. In this paper, we define significant messages, which are to be ordered at the application level, by using a conflicting relation among the transactions. We newly p...
متن کاملObject-Based Locking Protocol for Replicated Objects
Distributed systems are composed of multiple objects. Each object supports more abstract operations than the lowlevel read and write operations. The objects are replicated to increase the performance, reliability, and availability. In this paper, we discuss a synchronization method to make multiple replicas of objects mutually consistent. In the traditional optimistic two-phase locking (O2PL), ...
متن کاملTransaction-Based Causality in Distributed Replicated Objects
In distributed applications, multiple objects are cooperating by exchanging messages with each other. It implies larger computation and communication overhead to causally order all the messages transmitted in the network. Transactions in clients manipulate objects in servers by sending read and write requests to the servers. In this paper, we de ne what messages to be causally ordered based on ...
متن کاملAn Extended Atomic Consistency Protocol for Recoverable DSM Systems
This paper describes a new checkpoint recovery protocol for Distributed Shared Memory (DSM) systems with read-write objects. It is based on independent checkpointing integrated with a coherence protocol for the atomic consistency model. The protocol offers high availability of shared objects in spite of multiple node and communication failures, introducing little overhead. It ensures fast recov...
متن کامل